PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bostr.20129s0530.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Boechereae; Boechera
Family HD-ZIP
Protein Properties Length: 740aa    MW: 82558.1 Da    PI: 6.6072
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bostr.20129s0530.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox665.1e-2194149156
                           TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           r+k +++t++q++ +e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  Bostr.20129s0530.1.p  94 RKKYHRHTTDQIRHMEALFKETPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 149
                           7999************************************************9877 PP

2START2338.2e-732514781206
                           HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S.. CS
                 START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla.. 77 
                           e+a++a+ el+k+a+++ep+W +s+    e++n+de+l++f+++++      +++ea+r++g+v+m++++l ++++d++ qW+e++a  
  Bostr.20129s0530.1.p 251 EIANRATLELQKMATSGEPLWLRSVetgrEILNYDEYLKEFPQAQAssfpgrKTIEASRDVGIVFMDAHKLAQSFMDVG-QWTEMFAcl 338
                           578999*************************************999*********************************.********* PP

                           ..EEEEEEEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEE CS
                 START  78 ..kaetlevissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgilie 155
                             ka+t++vi++g       ga+qlm+ e+q+l+p+vp R+++fvR++rql+ ++w+ivdvSv++e++++e ++s+ ++++lpSg++ie
  Bostr.20129s0530.1.p 339 isKAATVDVIRQGegpsridGAIQLMFGEMQLLTPVVPtREVYFVRSCRQLSPEKWAIVDVSVSVEDSNTEkEASLLKCRKLPSGCIIE 427
                           ***************************************************************************************** PP

                           EECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                 START 156 pksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                           ++snghskvtwveh d+++++++ l+rslv++gla+ga++wvatlq +ce+
  Bostr.20129s0530.1.p 428 DTSNGHSKVTWVEHLDVSASTVQPLFRSLVNTGLAFGARHWVATLQLHCER 478
                           *************************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.2E-2279145IPR009057Homeodomain-like
SuperFamilySSF466892.42E-1987152IPR009057Homeodomain-like
PROSITE profilePS5007117.81591151IPR001356Homeobox domain
SMARTSM003899.6E-1893155IPR001356Homeobox domain
PfamPF000462.6E-1894149IPR001356Homeobox domain
CDDcd000862.74E-1698149No hitNo description
PROSITE patternPS000270126149IPR017970Homeobox, conserved site
PROSITE profilePS5084843.52242481IPR002913START domain
SuperFamilySSF559614.53E-31245478No hitNo description
CDDcd088751.53E-109246477No hitNo description
PfamPF018521.0E-66251478IPR002913START domain
SMARTSM002341.8E-83251478IPR002913START domain
Gene3DG3DSA:3.30.530.201.1E-7292477IPR023393START-like domain
SuperFamilySSF559616.46E-16507727No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 740 aa     Download sequence    Send to blast
MAVEMSSKQP TKDFFSSPAL SLSLAGIFRN ASSGSTNPEE DFLSRRVVDD EDRTVEMSSE  60
NSGPTRSRSE EDLEGEDHEE EEEEEGNKGN KRKRKKYHRH TTDQIRHMEA LFKETPHPDE  120
KQRQQLSKQL GLAPRQVKFW FQNRRTQIKA IQERHENSLL KAELEKLREE NKAMRESFSK  180
ANSACPNCGG GPDDLHVENS KLKAELDKLR AALGRTPYPL QASCSDDQEH RLGSLDFYTG  240
VFALEKSRIA EIANRATLEL QKMATSGEPL WLRSVETGRE ILNYDEYLKE FPQAQASSFP  300
GRKTIEASRD VGIVFMDAHK LAQSFMDVGQ WTEMFACLIS KAATVDVIRQ GEGPSRIDGA  360
IQLMFGEMQL LTPVVPTREV YFVRSCRQLS PEKWAIVDVS VSVEDSNTEK EASLLKCRKL  420
PSGCIIEDTS NGHSKVTWVE HLDVSASTVQ PLFRSLVNTG LAFGARHWVA TLQLHCERLV  480
FFMATNVPTK DSLGVTTLAG RKSVLKMAQR MTQSFYRAIA ASSYHQWTKI TTKTGQDMRV  540
SSRKNLHDPG EPTGVIVCAS SSLWLPVSPT LLFDFFRDEA RRHEWDALSN GAHVQSIASL  600
SKGQDRGNSV AIQTVKSREK SIWVLQDSCT NSYESVVVYA PVDINTTQLV LAGHDPSNIQ  660
ILPSGFSIIP DGVESRPLVI TTRQDDRNSQ GGSLLTLALQ TLINPSPAAK LNLESVESVT  720
NLISVTLQNI KRSLQIEDC*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
19195RKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF3602940.0AF360294.1 Arabidopsis thaliana putative homeobox protein GLABRA2 (At1g79840) mRNA, complete cds.
GenBankBT0019560.0BT001956.1 Arabidopsis thaliana clone U09291 putative homeobox protein GLABRA2 (At1g79840) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqNP_565223.10.0homeobox-leucine zipper protein GLABRA 2
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLF4HQC00.0F4HQC0_ARATH; Homeobox-leucine zipper protein GLABRA 2
STRINGAT1G79840.20.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM123702731
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein